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Orderliness, reflected via mathematical laws, is encountered in different frameworks involving social 
groups. Here we show that a thermodynamics can be constructed that macroscopically describes 
urban population flows. Microscopic dynamic equations and simulations with random walkers un- 
derlie the macroscopic approach. Our results might be regarded, via suitable analogies, as a step 
towards building an explicit social thermodynamics. 



I. INTRODUCTION 

The application of mathematical models to social sci- 
ences has a long and distinguished history-^. One may 
speak of empirical data from scientific collaboration net- 
works-, cites of physics journals 3 -, the Internet traffic 4 , 
Linux packages links^, popularity of chess openings 6 -, as 
well as electoral results-'-, urban agglomerations*^ and 
firm sizes all over the worldi 1 -. A specially relevant issue 
is that of universality classes defined by to the so-called 
Zipf's law (ZL) in the cumulative distribution or rank- 
size distributions £r££r2^. Maillart et al£ have found 
that links' distributions follow ZL as a consequence of 
stochastic proportional growth. Such kind of growth as- 
sumes that an element of the system becomes enlarged 
proportionally to its size fc, being governed by a Wiener 
process. The class emerges from a condition of stationar- 
ity (dynamic equilibrium) 1 ^. ZL also applies for processes 
involving either self-similarity 6 - or fractal hierarchy 1 ^, all 
of them mere examples amongst very general stochastic 
ones 1 ^. A second universality-class was found by Costa 
Filho et alii, who studied vote-distributions in Brazil's 
electoral results. Therefrom emerge multiplicative pro- 
cesses in complex networks^. Such behavior ensues as 
well in i) city-population rank distributions 16 , ii) Span- 
ish electoral results^ 6 -, and iii) the degree distribution of 
social networks 1 ^. As shown in Ref.—, this universality 
class encompasses Benford's Lawi£. In the present vein, 
still another kind of idiosyncratic distribution is often re- 
ported: the log-normal one 2 ^, that has been observed in 
biology (length and sizes of living tissue^), finance (in 
particular, the Black and Scholes model^ 2 -), and firms- 
sizes. The latter instance obeys Gibrat's rule of propor- 
tionate growth^ 3 -, that also applies to cities' sizes. 

Together with geometric Brownian motion, there is a 
variety of models arising in different fields that yield 
Zipf's law and other power laws on a case-by-case ba- 
sis2 ] 10 i 15 i 24 ~ ^£, as preferential attachment 4 and compet- 
itive cluster growth-*^ in complex networks, used to 
explain many of the scale-free properties of social net- 
works. For instance, we may mention detailed realistic 
approaches in urban modellin g 2 ' 28 , opinion dynamics^, 
and electoral results^^ -. Of course, the renormalization 
group is intimately related to scale invariance and asso- 



ciated techniques have been fruitfully exploited in these 
matters (as a small sample see 12611311 ). 

It has been recently shown, in Ref. [32], that a variational 
principle based on MaxEnt can be successfully applied to 
scale-invariant social systems. Used in the present con- 
text, it allows for a classification of the above cited be- 
haviors on the basis of inferences drawn from objective 
observables of the system. We had also shown 3 - 3 , that 
including some dynamical information in the variational 
scheme 3 -^ one is able to reproduce the shape of empiri- 
cal city-population distributions, going beyond the cus- 
tomary universality classes conventionally used in such 
regards. Indeed, a connection between explicit micro- 
scopic growth equations and the macroscopic character- 
ization exists, illustrated for logistic-growth in Ref. |H. 
We will here describe the manner in which the methods 
of that paper can be generalized to first-principles theo- 
retical framwework describing population flows in terms 
of thermodynamic concepts. 



A. Motivation, statement of the problem and goal 



We are looking here for more that models: what we aim 
for is to discover physical principles that may underlie 
some social phenomena. Our system is a specific geo- 
graphical area whose population is distributed amongst 
several population-nuclei (cities, villages, towns, etc.) 
Each nucleus' population is time-dependent due to mi- 
gration, birth, death, etc. Our aim is to quantitatively 
describe the population-nuclei's variation. Microscopic 
variables are plentiful, but our main goal is to be able to 
identify macroscopic variables that can give a reasonable 
account of urban population-variations. 

We will proceed in seven steps, as indicated in the scheme 
below: 
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1. Introduce the basic observables and the empirical 
data sets. 

2. Identify the stochastic nature of the city- 
population growth rates. 

3. Postulate dynamic microscopic equations and em- 
pirically validate them. 

4. Perform numerical simulations with random 
walkers following these dynamical equations and 
parametrize the macroscopic evolution. 

5. Show that equilibrium configurations of such evo- 
lutions can be predicted by MaxEnt using few 
macroscopic parameters. 

6. Derive thermodynamic-like relations between 
these macro-parameters. 

7. Show the applicability of our thermodynamic de- 
scription by modeling empirical urban flows as an 
scale invariant ideal gas. 



The paper is organized as follows. Step 1 is addressed in 



the next Section II. Section III deals with step 2, Section 
IV with step 3, Section V with step 4, and Section VI 
with step 5 and 6. Finally, the application is dealt with 
in Section VII, and some conclusions are drawn in Section 
VIII. 



II. PRELIMINARY MATTERS 

The basic ingredients we need in our approach, following 
Refs. lalHl are 

i) n, the total number of "population-nuclei" ; 

ii) Xi(t), the population of the i-th nucleus at time t 
(and x(t) = {xi(t)}f =1 a vector with all the popu- 
lations); 

iii) Xq and xm, the minimum and maximum allowed 
nucleus' population (in general xq = 1 and xm — 

oo); 

iv) Nt, the total area's population (Nt = J27=i x » W)> 

v) Xi(t), the time-derivative of Xi(t) (thus the pairs 
{(xi, ii)}" =1 compose the "urban phase space"); 
and 

v) some a priori knowledge of the dynamics at hand, 
written as 

± i {t) = ki{t)g i [yL{t)] (1) 

where gi are population-functions to be determined 
and ki(t) growth rates independent of the g,. 



The raw data used in our analysis is obtained from the 
Spanish state institute INE 36 and cover annually the pe- 
riod 1996-2010 (with the exception of 1997). It encom- 
passes up to 8000 municipalities (the smallest Spanish 
administrative unit) distributed within 50 provinces (the 
building blocks of the autonomous communities). We 
use provinces and municipalities as the closest represen- 
tatives of the ideal of a closed system's fundamental ele- 
ments. Also other regions of the world are used as exam- 
ples along the text. In this tableau, the total population 
Nt of a province is apportioned in n nucleus. The i-th 
nucleus account a population of xq < Xi (t) < xm at time 
t, which time-evolution obeys Eq. ([l}. 



III. THE STOCHASTIC NATURE OF 
POPULATION GROWTH RATES 

We begin dealing with step 2 of our Scheme, saying 
something meaningful concerning the form of the growth 
rates ki(t) in Eq. ([1}. The value of fej above de- 
pends upon millions of individual decisions, so it is ex- 
pected some stochastic behavior. We should know both 
the average m, = (ii(t))st and the standard deviation 
Si = ((xi(t) — fni) 2 )\[ 2 (for each i) in a time-window 
5t around t. Trying then to study the distribution of 
= (ii(t) — mi)/ Si one immediately finds 

mi = (ki(t)gi(t))st 

= (ki(t))st x (ft(t))at, (2) 
*i = ((h(t)gi(t)-(ki(t)gi(t)) 5t ) 2 ) st 

= <<&(*)>*-<**(*)>«<£ ( 3 ) 

with (Tfc i , a gi being the standard deviations of hi (t) and 
g%(t) = </i[x(t)], respectively. Assuming now that the 
function g^s variation in the time- window for which one 
evaluates the pair m, - s, is negligible (i.e., <jg <C cj^), 
to a good approximation one has 

entailing that £j(t) has null average and unit standard 
deviation. If this assumption is correct the shape of the 
Pz~ distribution of the variable &(t) should not depend 
upon Xi(t). We have verified the hypothesis, as our first 
result here, with reference to all (8116) Spain's municipal- 
ities. Fig. [T] displays the (xi, pairs for every township 
in the time-window St — 15 years. From them we eval- 
uate appropriate points taken at regular intervals from 
the cumulative distribution function of our random vari- 
able (quantiles) as a function of the population x. No 
apparent x— dependence can be detected. The overall 
distribution ps, (£) shape looks like a normal one 




3 



with cumulative distributions of the form 



1 + erf 



(G) 



shown in Fig. [TJ Save for some fluctuations, we have 
not found any dependence on the shape of p~ (£) for the 
different provinces (same Fig. [1]). Accordingly, but with 
a grain of salt, one may speak of "universality" . Con- 
sequently, we will consider herefrom that our variable £ 
can be regarded as belonging to a Wiener process, our 
second result here. 



IV. INTRODUCING MICROSCOPIC 
EQUATIONS OF MOTION 

A. Proportional growth 

We are now at step 3. For the g^s shape we will as- 
sume that it depends only on its own Xi's population, 
i.e., <7i[x(i)] ~ gi[xi{t)\. In order to guess the explicit 
analytical form we appeal to a cluster-growth model 
in networksj&il used successfully before describing city- 
population distributions. We firstly consider a network 
of nodes (that eventually represents the social network) 
and a single node as seed of a cluster. Initially, the first 
neighbors of the seed will belong to the cluster with a 
given probability P(t = 0). At a subsequent time t, the 
first neighbors of the members of the cluster become also 
members with probability P(t). Proceeding in this vein, 
it is reasonable to conjecture that the time-variation of 
the cluster-size x at time t acquire the form 



x(t) 



(7) 



where Cj (t) is the first-neighbors-number of node i-th at 
time t. We appeal now to the central limit theorem to 
write 



x{t) = P(t) (c(t)x(t) + a c {ty 



(8) 



Here c(t) is the mean neighbor's number at time t, 
a c (t) its standard deviation, and £(t) an independent 
normally-distributed number. This last summand, usu- 
ally neglected for very large sizes, is associated to finite- 
size effects. The first term, size-proportional, generates 
proportional (or multiplicative) growth. In view of this 
result, we consider the form 



9i[Xi 



(9) 



with a = 1 or 1/2. Considering then both terms in the 
microscopic dynamics we write 



±i(t) = kn(t)xi(t) + k ll/2 {t)y / x i (t), 



(10) 



with the kn(t) and kn/2(t) two (a priori) independent 
Wiener coefficients. This dependence is checked out by 



comparison of the previously employed Sj— numbers with 
a functional form of the type 



s 2 {x l ) = {[x,] 2 - (xi) 2 st )st 

2 2,2 
— a il x i T < J il/2 X i' 



(11) 



where an and o~i\/2 ar e the associated deviations of fcji/2 
and kn, respectively. Rewriting (|11[) in a more conve- 
nient way we have 



It \ I _ 2 , 2 

s i (Xi)/Xi — o- a Xi -t- 
that, for sizes small enough reduces to 



(12) 



s 2 (xi)/xi^a 2 1/2 , (13) 



while for very large sizes one has 

(,Xi)/Xi ~ &i\Xi' 



(14) 



The transition between these two regimes should take 
place at a value xt = crf, 2 /a 2 . Fig. [2] displays, as our 
third result, the (xi, s 2 (x)/xi) — pairs for all the Span- 
ish municipalities, together with appropriate quantiles. 
The median rned(si(xi)/xi) nicely fits things with o~n — 
0.0119 and cr.a/2 = 0.47. We appreciate the fact that fi- 
nite size fluctuations are larger than multiplicative ones, 
the later dominating, of course, for large sizes. Our tran- 
sition occurs at population- values of the order of 1500 
inhabitants. Surprisingly enough, the distribution of the 
variable s- = \og[s l (x i ) / y/xl] - log[med(s l (x i )/^/57)] be- 
comes independent of xi , being of a Gaussian nature. 
At this point, we need still to address a further ques- 
tion. The finite-size term average is {kii/ 2 {t))st = (by 
definition), but this is not so for the multiplicative one 
(kn(t))st 7^ 0, that is a priori regarded as constant and 
size-independent. This is indeed empirically true on oc- 
casions, but not always. For instance, such assumption 
cannot account for the migration from the countryside 
to big cities, where the mean growth rate correlates with 
the city-population. 



B. Taking into account internal flow 

It is a fact that small populations tend to diminish while 
large towns tend to increase their population. We en- 
counter this scenario for most of the 50 provinces of 
Spain. We intend to tackle this issue below. 

We can show that the effect can be described by recourse 
to a smooth dependence of the mean relative growth 
(ii/xi) on log((a:)) that generates what we will call in- 
ternal flow. A second order expansion in log ((a;)) reads 



(x/x) ~ a + 61og((z)) + clog((a;)) 2 



(15) 



where the values of a, b and c come from the correspond- 
ing Taylor coefficients. Assuming i > c we can safely 
write it as 



(x/x) ~ (h) + (k q )i(x)r 



(16) 



4 



where we have defined for convenience (k±) — a — b 2 /2c, 
(k q ) = b 2 /2c and q — 1 = 2c/b. To validate our as- 
sumptions, we fitted the empirical provincial data to Eq. 
(|16[) via (fci), (k q ) and q, when possible (in some cases a 
quasi-linear relation is found, generating large uncertain 
in the optimal values). We have found for the exponent 
q a mean value of 1.2 and a standard deviation of 0.45, 
with | <gr — 1 1 < 1 in all cases. This result confirms the as- 
sumption b c validating the second-order expansion of 
(ii/xi). Moreover, as seen in Fig. [3] (our fourth result), 
nice fits are found in general with very few exceptions. 
With this new hypothesis our complete dynamic equation 
turns out to be 



Xi(t) = k ig (t)[x i (t)] q + k il (t)x i (t) + k n/2 (t)^/x~^ ) (17) 

with ki q (t), kn(t) and kn/ 2 (t) independent (a priori) Wi- 
enner processes. Summing up, we have assumed 

• a finite size term that dominates things for low pop- 
ulation levels (< 1500), 

• a multiplicative term that accounts for population's 
growth/diminution (births, death or o external mi- 
gration, and 

• a power-law (exponent q *~ 1) accounting for inter- 
nal migration. 

Since for most of the population range only one term 
dominates, we will include only one term in the consid- 
erations what follow below. 



V. FROM MICROSCOPIC TO MACROSCOPIC 
DESCRIPTIONS 

We arrive to stage 4, having discussed above a micro- 
scopic population dynamics. We will tray now to ascer- 
tain whether a macroscopic description is also feasible. 
Our goal is to reduce the 2n microscopic degrees of free- 
dom to a few macroscopic ones. We will separately con- 
sider each of the three terms of the dynamic equation. 
The ensuing results will be valid in the domains in which 
each term dominates. 

Consider n random walkers characterized by a dynamic 
coordinate Xi(t) obeying 



ii(t) = ki(t)[xi(t)] 



(18) 



with ((ki(t) — k)(kj(t) — k)) = akSijS(t — t'). Parameter q 
will take as special possible values 1/2 or 1, or in general, 
< q. 



A. Brownian motion and diffusion equation 

We start with q = as control case. One has ii(t) — ki(t) 
so that we deal with the well-known brownian random 
walkers. Consider this numerical procedure: initially, the 



n walkers are located at, say, x — xq. By p(x,t)dx we 
will refer to the walker's normalized histogram, at time t, 
that indicates the walker's relative number positioned in 
the interval dx around x. The associated initial density 
would read p(x, 0) = 5(x — xq). A discrete version of the 
pertinent dynamic equation is 



Xi(t + At) = Xi(t) + Atki(t), 



(19) 



that forces the walkers to "move" during the period At in 
a amount given by Atfci(t), with ki(t) a random number 
generated from a Gaussian distribution determined by an 
standard deviation a k and mean fc, as defined above. We 
have 

M 

Xi(t = MAt) = x + At h[{M - l)At], (20) 

m=l 

so that after M iterations the walkers-distributions coin- 
cides with that of a random number generated by sum- 
ming up M Gaussian numbers characterized by Atak and 
Atk. Remind that a distribution that follows a random 
number composed of two other numbers of that character 
is the convolution of the distributions associated to these 
later numbers. Thus, x(t) is described by the M— th con- 
volution of the fc's Gaussian distribution. By recourse to 
a Fourier transform T for convolutions we have 



T\ P {x,t)\ = J 7 



e -(fc-Atfe) 2 /2(At I r fc ) 2 



-i \ M 



f^Atcjk 



-At 2 a 2 k iu 2 /2+iAtku 



e — "«~ (21) 
and, appealing to the inverse transformation, 

e -(k-MAik) 2 /(2M(At<r fc ) 2 ) 



p(x,t) 



yf2TTMAtcr k 

p -(k-tk) 2 /(4Dt) 



y/AnDt 



(22) 



where we have introduced for convenience 2D = Ata\. 
An arbitrary density p(x,t) will evolve in At, via the 
convolution of that density with a Gaussian of deviation 
Atak = v2AtD and mean Atk, as 

T[p(x, t + At)} = T[ P (x, t)] x e ~^Du 2 +l Aiku (23) 
~ F[p{x, t)\ (1 - AtDu 2 + iAtku) , 

where we take At arbitrarily small. A simple manipula- 
tion involving division by At leads now to 



T[ P (x,t + At)]-T[p(x,t)] 
At 



-Duj 2 + ikuj)T[p(x,t)}. 

(24) 

By recourse to the inverse transformation and taking the 
limit At — > we get 

d tP (x, t) = Dd 2 xP {x, t) - kd xP (x, t), (25) 
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which is a diffusion equation. Accordingly, we reach an 
important result here (our fifth one): 



Our original 2n degrees of freedom-problem can now be 
tackled via just a few macroscopic parameters. 



B. q- metric Brownian motion 

In the general instance q =/= we introduce a variable 
m = log (xi), where log q is Tsallis' g-logarithraSS. The 
Jacobian for the transform is du/dx = l/x q so that u = 
x/x q and the associated dynamical equation becomes 



Ui(t) = ki(t). 



(26) 



In the set {(iti, Wi)}™ =1 , the variables m and iii are inde- 
pendent of each other. We regard them, of course, as our 
dynamical variables. Note that one recovers Brownian 
motion for u. Indeed, 



M 



i(t = M At) = Ui(0) + At h[(M - I) At], (27) 



and then the demonstration of the preceding subsection 
becomes valid, now for u and p{u, t)du. Our new diffusion 
equation reads 



d t p(u, t) = Dd%p(u, t) - kd u p{u, t), 



(28) 



and, starting from a density p(u, 0) = S(u — uq) we end 
up with 



p(u, t)du = 



du 
4nDt 



exp 



(u — Uq — ktY 

4Dt 



(29) 



The x— density is governed accordingly by a glog- normal 
distribution 



px(x,t)dx — p[u(x),t]—du 
du 



(30) 



dx 



■ exp 



(logg(x) - u - ktf 



VAttDIxI 
In particular, for q = 1/2 one has 



4Dt 



px(x, t)dx 



dx 



VAttTHx 



exp 



(2(Vi- 1) -uq- ktf 



4Dt 



and, for q = 1 the well known log-normal 



px(x, t)dx 



dx 



y/InDtx 



exp 



(log(x) - Uq - ktf 



ADt 



(31) 



(32) 



We have again reduced the microscopic number of de- 
grees of freedom to just a few macroscopic parameters. 



C. Examples of diffusion 

Numerical experiments confirm our findings above. We 
start with our dynamical equation in discrete form 



Xi(t + At) = Xi(t) + Atk t (t)[xi(t)] q 



(33) 



using ki(t) — y/2D / At(,i(t) + k, where the random 
numbers £ follow a normal distribution such that 

(&(*)£?(*)) = <M(£ - *')■ We have taken 1 = V 2 and 1 
for our examples, and find that the associated distribu- 
tions exactly follow the diffusion equation's predictions. 
We have used in the former case uq = log 1 / 2 (220), k = 0, 
and <7? = 10, in intervals of At = 0.01. In the later 
instance we had u — log(4400) instead. Indeed, the 
walkers' histograms' evolution follow Eq. (f3~Tj) and Eq. 
(f3"2"| . respectively, with D — Ata\j2 as defined above 
(see Fig. 2] for the cumulative distributions) . 
As empirical examples we discovered that for small pop- 
ulations < 1500 inhabitants the finite-size noise domi- 
nates. Provinces for which most towns are scarcely pop- 
ulated will obey the dynamical equation with q = 1/2. 
Such is the case for the province of, i.e., Salamanca, as 
shown in top panel of Fig. 3) The ensuing dynamics con- 
firms this assertion. The relative growth of most of the 
towns follows a dynamics with a variance s 2 oc \fx (red 
line of the inset). The ensuing distribution fits the final 
state predicted by the diffusion equation for that dynam- 
ics, Eq. ([3l]), with uo+kt = log 1/2 (216.3) and 2Dt = 95.6 
for year 2010 (see Fig. 4). Remark that the 1/2-log- 
normal can be easily confused with the usual log-normal, 
although the former exhibits asymmetries in log-scale. 
As a q — 1— example we mention Florida State in the 
US^it (see also bottom Fig. g]). Using data from 1990, 
2000, and 2010, we have verified that the microscopic 
dynamics confirms the proportional growth assumption 
(with a variance of the relative growth independent of the 
size, as illustrated in the inset). The city-populations dis- 
tribution follows a log-normal distribution, that of Eq. 
(|3"2"1) . which can be the one pertaining to geometrical 
random- walkers' diffusion, with uq + kt = log(4380) and 
2Dt = 2.96. 



D. Constrained diffusion 

q-log-normal distributions do not set any limits to 
population-sizes. However, it is reasonable to as- 
sume that physical space does pose limits to a city's 
population-growth. Unlimited growth is unrealistic since 
in the case of internal migrations the total population 
Nt should remain constant and a free-diffusion model 
is, again, unrealistic. Constrained diffusion must be 
contemplated instead. 

We pass now to consider numerical experiments with 
random walkers that fix lower and upper bounds for pop- 
ulation. These are denoted by xq and Xm, respectively. 
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Now, walkers "moves" leading to values outside the 
range Xq < x < ij; are to be rejected in our simulations. 
Fig. [5] shows that a (/-metric walkers' evolution begins by 
faithfully following the diffusion equation Eq. (|28p till 
they bump off these extreme values. Now their density 
deviates from that of "free" evolution. After some time 
has elapsed, an equilibrium x— distribution is reached 
that follows a power-law with exponent q, independently 
of the initial state. The origin of this systematic result 
can not be unraveled by the simulations, so a higher-level 
of theory is needed. 

Now we use a total population constraint. This is equiv- 
alent to make the walkers move under the rule of a g- 

generalized multi- component logistic equation 



Xi (t) = [x l {t)}'' 



hit) 



£2=! **(*)[*<(*)]« 



£?=!>» 



(34) 



Indeed, it is easy to check that d t Nx = J27=o = 0' 
thus preserving the value of Nt in time. Also the origi- 
nal g-symmetry of the dynamics is preserved. This equa- 
tion is the q-generalization of the scale- invariant multi- 
component logistic equation presented in|35|. Results are 
displayed in Fig.[5]for q = 1, 1.5 and 2, using n = 100000 
walkers and a total population of N = 250000 inhabi- 
tants (with xq — 1). Remarkably enough, equilibrium 
is always reached, to a density that does not depend 
upon the initial state or the fc-parameters. The shape 
of the distributions resembles x power-laws with expo- 
nential cut-off. Again, the simulation can not unravel 
the origin of this form. Finding the properties and the 
exact analytical form of those macroscopic equilibrium 
distributions is our goal in the sext Section. 



VI. THE MACROSCOPIC CONUNDRUM 

We tread now step 5. Our simulations with ran- 
dom walkers suggest that it is indeed possible to pass 
from a description that uses 2n microscopic variables 
to a description involving just a few macroscopic pa- 
rameters. The big question is: do they behave in 
thermodynamic fashion, satisfying the pertinent par- 
tial derivatives-relationships? We wish to tackle this 
issue now looking for a way to reduce the number of 
microscopic degrees of freedom to a few manageable 
macroscopic ones while keeping a coherent, reasonable 
description of our system, mimicking the kind of sce- 
nario that links statistical mechanics to thermodynam- 
ics. This requires appropriate constraints, a topic to be 
addressed below by enumerating the appropriate "social" 
constraints we need. 



A. Macroscopic constraints 

• Total number of cities n. Since there is some confu- 
sion in the available data about what the administrative 



meaning of city is, we wish to ascertain that this issue 
is of no importance. Consider Xi — £™ ! ly, where rii is 
the number of sub-administrative units included in the 



administrative unit i, with 



Xij their sub-administrative 
populations. Considering proportional growth, we write 
for the time-evolution 



Xi{t) = y~]xjj{t) 

3 

= kij(t)xjj(t) 



£ j ' kij (t) x ij (*) \ 



(35) 



where we have defined k'^t) as a new variable defined 
as an average weighted by the populations Xij . If the 
growth rates kij are random variables with approxi- 
mately the same mean and variance, it is easy to check 
that k'i (t) is in turn a random variable of the same mean 
and variance. The dynamical behavior of the ensemble 
of administrative units x is thus equivalent of that of 
the sub-units, and the procedure described in this work 
is still applicable. 



• Maximum/minimum population xm/xq. It is well- 
known that a typical minimum population size equals 
the Dunbar number 40 (~ 150), heuristically associated 
to the maximum (allowable by our neo-cortex) number 
of stable human relationships. Thus, it is reasonable 
to think of a minimum size xq ~ 150. In many cases a 
maximum number for a city population xm can be es- 
tablished via consideration of geographical peculiarities 
as mountains 41 - or oceans^ 2 . (See Fig. [7] for an example). 
In such cases it is convenient to employ the transform 
u = \og q (x/xo). An associated, valuable macroscopic 
parameter is um = \og q (xM: /xq). We will be dealing 
then with a "volume" < u < Um- 



• Total population Nt- We have Nt = £" = i Xi that 
gets transformed into Nt — xo £™ =1 . A useful 
quantity becomes then N — Nt/xq. 



• Total variance of u. With reference to the dynam- 
ics, a useful observable is the total variance for relative 
growth er 2 = £ ™ =1 ((iii — (ui) t ) 2 )/n. For a Gaussian form 
(see Fig. [J) this quantity measures fluctuation-intensities. 
Generalizing, this quantity can be defined by the covari- 
ance matrix with elements = ((iii — (iii))(uj — (Uj))}, 
using its trace as a thermodynamical variable Tr(Q) = 



7 



En 
i=i' 



H-III. Maximum entropy principle (MaxEnt) 



37 



i=l 
T 2 

2 Ha ' 



(36) 



where we add for dimensional convenience a factor r/2. 



B. Fundamental hypothesis for 
urban-thermodynamics 

Let us discuss the pertinent three hypothesis that we 
need in our Scheme: 

• H-I. Microscopic hypothesis. 

We adopt as fundamental dynamical equation Eq. (fT8|) 
[x = kx q ] for the population of a center, linearized via 
the variable u = log 9 (x/xo). We will think of the pair 
(it, it) as constituting our social phase space coordinates. 
We can speak of an: 



H-II. A priori phase space equiprobability in (u, it) 



si 



The probability density distribution for the i— th phase 
space cell centered at (ui,u{) of size dudii is defined as 
p[{{ui, Ui)}™ =l ]d n ud n u. Accordingly to H-II, the system's 
entropy is written as 

s[ P ] = - [ dvu p[{K,^)}r=i]iogb[{(^,ii i )}? =1 ]] . 



(37) 

Since none of our macroscopic observables is able to dis- 
tinguish amongst population nuclei, towns are thus indis- 
tinguishable. In this case, the useful distribution is the 
one-body density p(u, ii) defined as 

P ( U) u) = J (T^ud^u pKfai.uOKU], (38) 
and thus, 

S[p] = — I dudii p(u, it) log [p(u, u)] . (39) 



Macroscopic observables are written in terms of the one- 
body density as 



n = J dudu p(u,u), (40) 
N = J dudu p(u,u)e q (u), (41) 
[7=1/ dudu p(u,u)u 2 . (42) 



Equilibrium is determined via constrained entropic max- 
imization using n, um, N y U. This determines the equi- 
librium density p(u, it) that is a solution of the entropic 
variational problem 



with 



6{S[p]-pA[p}} = 0, 



A = U — pn + pu M + AN, 



(43) 



(44) 



where f3, p, p and A stand for the pertinent Lagrange 
multipliers, that will be seen below to acquire the char- 
acter of intensive thermal-quantities. 



C. Thermodynamical relations 

We enter step 6 by considering the Lagrangian A[p] [and 
Lagrangian density a(u,u)]. It reads 

A[p] = j dudii p(u,u)a(u,u) (45) 

dudu p(u, it) | —u 2 — p + pv(u) + Ae 9 (u)| , 

where the volume condition is enforced by an infinite- well 
potential 



v(u) = { UM / n f or < u < u m; 

^ ' ] oo otherwise 



(46) 



The well-known general solution to the entropic problem 
Eq. (|43|) is 37 p{u,u) — exp[— /3a(u, u)}, so that 

p{u,u) = | e -^ 2 -/3Ae,W ( 0<u <um), (47) 

where the normalization factor Z (partition function) be- 
comes 



Z = 



du / du e-T^MM") 



'— E q (/3A,u M ), 



(48) 



with E q (l,m) the generalized exponential function of or- 
der q 



E q {l,m) = E q {l)-e {1 -^ m E q {le m ). 



(49) 



Our constraints in U and N determine the multipliers /3 
and A-values. On the one hand, we have the ii— variance 



U 



2 

n 

2/3' 



,. /3t _(LLu 2 -2 

du \ —e 2 u 
V 2vr 



(50) 
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and on the other one, we have to deal with the total 
population 



N = n 



du 



e q (u)e-^ Ae "^ 



o E q (j3A,u M ) 
B,-i(M,mm) 



E q {f3A,u M ) 
nF q (pA,u M ), 



(51) 



where we use the function F q (l,m) — di\og[E q {l,m)]. 
We obtain from the former the direct result 




and, via inversion of the latter equation [defin- 
ing first L q (f,m) = F~ 1 (f,m) and thus 
F q [L q (f, m), m] — /], we finally obtain the rela- 
tion between the system variables (equation of state) 



(3A = L q {N/n,u M )- 



(53) 



Note that we have intensive quantities on the left hand 
side, while extensive ones appear in the r.h.s.. The en- 
tropy becomes 



nloe 



1 /2tt 



n V /3t 



E q (/3A,u M ) 



1 



+ pAF q (pA,u M ) 



(54) 



Using now Eq. ([53)) we can recast things in term of the 
natural variables as 



S(U,n,UM,N) — nlog 



E q [L q (N/n,u M ),u M ] 



n 
2 



L q (N/n,u M )N. 



(55) 



It is easy to verify, but crucial to our present goals, 
that macroscopic observables and Lagrange multipliers 
become linked entropic-wise via 



P 



P 
A 



dS_ 

uu n,u M -N 
I ^ 

P dn U,u M ,N 

1 OS 



P duM 
1 dS_ 

P cW 



U,n,N 



(56) 
(57) 
(58) 
(59) 



U.um ,n 



The first relation leads to Eq. (|52j) . also showing 
that the f3— multiplier is the inverse of the variance 
P = 1/rer 2 . The last relation takes us to Eq. (|53"|) and 
is indeed one of our equations of state. The other two are 



P(i = log 



-\—E q [LJN/n,u M ),UM] 
n V tit 



- 1 



-2L^°\N/n,u M ) 



N 



(60) 



and 



Pp 



n exp[-L q (N/n, u M )e q {u M )} 

E q [L q (N/ u,um),um] 
+2L Q °' 1 HN/n,u M )N. 



(61) 



At this stage the reader will agree that it is fair to assert 
that our goal has been successfully reached. We have in- 
deed constructed a social thermodynamics for urban pop- 
ulation flows. The following equivalence may be estab- 
lished vis-a vis the thermodynamics of chemical species: 



• Temperature -h- Variance of relative growth. 

• Number of inhabitants o Number of particles. 

• Number of towns -H> Number of chemical species. 

• Volume o Maximum possible town's population. 



VII. APPLICATION: THE SCALE-FREE IDEAL 
GAS (SFIG) 

This is our final step 7. We envision two main regimes, 
according to the A— value: A = and A > 0. 



A. The SFIG-in-a-box 

We will consider in some detail the first case here. Differ- 
ent scenarios can be associated to A —> 0: i) the system is 
not isolated and exchanges population with its surround- 
ings, with a maximum-size constraint, ii) the triplet n, 
N, um is such that the equation of state yields A = 0, 
i.e., N/n — log q _ 1 [e q (uM)]/'U'M, or hi) no size-limitation 
exists (um — > oo) but N/n is large enough to consider 
A ~ 0. In the latter case one can obtain an effective 
um— value from normalization such that um — E q {0A). 
When A = the Lagrangian A is written as 



A = U — [in + pum j 



(62) 



so that we do not need knowledge of N. The equilibrium 
density is 

p(u,u)dudu — \j — e~~ u dudii (0 < u < Um)- 

UM V 27T 

(63) 
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The partial density p(u) = j dup(u,u) — u/um is con- 
stant in u so that x is given by a power-law 



px(x)dx = p[u(x)] — dx 
ax 



9-1 



dx 



UM X q 

with an associated rank-plot given by 

x(r) = x eg[u M (l - r/ri)], 



(64) 



(65) 



where r is the rank from 1 to n. Comparing with the 
equilibrium densities found above in our numerical ex- 
periments with random walkers, a very nice fit ensues 
as seen in Fig. 5, which validates our methodology. The 
entropy becomes 



S(U, n, um) — n log 



n 
2' 



(66) 



resembling that of the one-dimensional ideal gas. The 
state-equations are 



f3p = log 



and 



um 



(67) 



(68) 



in exact agreement with the ideal gas scenario. 

As empirical q = 1-examples we show the cases of 
i) Marshall Islands^ ii) d'Agosta Valley (Italy)^ and 
iii) Huelva-province (spaing. In all instances the rela- 
tive growth is nearly independent of the population (with 
some finite-size noise) as in the case of Huelva, or with a 
secondary constant trend for low-populated cities, as in 
d Agosta Valley's intance. Thus we consider that the mi- 
croscopic dynamics fits the proportional growth hypoth- 
esis, with densities p(u,u) nicely adapted to the ensuing 
thermodynamic predictions. In all cases, geographical 
conditions set strong limits to the city-sizes. The values 
for the macroscopic parameters are shown in Table [I] Re- 
markably enough, the pressure p due to the limited space 
is highest for the Marshall Islands. Indeed, this system 
exhibits the lowest volume um and the lowest (3 (highest 
"temperature") for a large number of units n. 





Marshall Islands Agosta Valley Huelva 


n 


160 


74 


79 


xo 


2.14 


126 


206 


u M 


0.038 


0.048 


0.059 




0.504 


2.05 


11.7 


P 


0.829 


0.0133 


0.0115 



TABLE I: Macroscopic parameters for the four SFIG-in-a-box 
examples (r = 100 years 2 ). 



B. The SFIG under total population-constraint 

We now consider A > 0, with um ~> oo for simplic- 
ity. This case describes regions where internal migration 
dominates the microscopic dynamics, and no upper limit 
is found for the city-size. According to the equation of 
state Eq. (|6"Tj) . p = in this limit, so that we deal with 
the lagrangian 



A = U - pn + AN. 
The equilibrium density is 



(69) 



p(u, u)dudu 




- 0Ae "^dudu (0 < u), 
(70) 



and the ccuation of state can be written in the form 



N 



gg-iOgA) 
1 E q tfA) ■ 



(71) 



The partial density for x is given by a power-law with 
exponential cut-off 



px [x]dx 



-0Ax 



E q ((3A) 
with an associated rank-plot 

x(r) = ^E- 1 [E q {A)r/n] . 



-dx, 



(72) 



(73) 



Again, this result fits the numerical equilibrium densi- 
ties found above in our numerical simulations (Fig. 6), 
validating again our methodology. 

This is the most common situation in the Spanish 
provinces (for more details see Refj^). We have found 
nice agreement between the q value obtained from a fit to 
the microscopic dynamics and the q— value obtained from 
the fit of the rank-plot to Eq. (f73|) . We show some ex- 
amples in Fig. 121 and the associated macroscopic numer- 
ical results in Table [TTJ In the examples presented below, 
using the parameter A as a measure of the pressure gen- 
erated by the total population constraint, it turns out 
that Alicante is the province with the highest pressure 
and Girona that with the lowest one, correlated with a 
highest and a lowest 'temperature', respectively. 



VIII. CONCLUSIONS 

After initially introducing some useful social-macroscopic 
and social-stochastic quantities we have 

1. Postulated social, dynamic microscopic equations. 

2. Validated them using urban population data. 

3. Performed numerical simulations with random 
walkers that conclusively demonstrated that a de- 
scription using many microscopic variables has as 
a counterpart a macroscopic one with few parame- 
ters. 
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Alicante 


Almen'a 


Girona 


Lleida 


n 


140 


101 


220 


230 




83.9 


147 


141 


105 


N 


18355.7 


3245.53 


4597.37 


2818.12 


log(/3A) 


-6.26 


-5.35 


-5.18 


-4.02 


A 


4.03 10~ 3 


1.83 10~ 4 


7.89 10~ 5 


4.1 10~ 4 




0.47 


25.9 


71.1 


43.8 


q 


0.862 


1.135 


1.27 


1.16 




Navarra 


Vizcaya 


Zaragoza 


Granada 


n 


271 


111 


292 


167 


xo 


47.6 


219 


40.7 


263 


N 


8976.55 


3571.01 


6906.03 


2539.5 


log(M) 


-5.16 


-5.18 


-5.06 


-4.01 


A 


6.69 10^ 4 


1.18 10" 4 


6.22 10" 4 


2.03 10~ 3 


P 


8.57 


47.4 


10.2 


8.91 


q 


1.06 


1.08 


1.18 


1.02 



Showed that such macroscopic description can be 
given an appropriate MaxEnt form after construct- 
ing a "social" phase space, that allows one to derive 
thermodynamic-likc relations amongst our macro- 
parameters. 



5. Finally, as an application, we successfully analyzed 
urban flows as modelled by a scale invariant ideal 
gas. 



TABLE II: Macroscopic parameters for the SFIG-under-pop- 
constraint examples. 
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FIG. 1: Top panel: quantiles from 0.1 to 0.9 each 0.1 for 
Ps(0 as a function of the population x (the median in shown 
in black). Bottom panel: ffe(£)'s cumulative distribution for 
each of Spain's provinces 



12 




10° 10 1 io 2 io 3 io 4 io 5 io 6 io 7 
x (Population) 



FIG. 2: Variance s 2 /x vs. x. Red: quantiles from 0.1 to 
0.9 each 0.1. Solid black: fit to the median value. Dashed 
black lines: Finite-size's fluctuations are constant, while the 
multiplicative regime is given by a straight line. 
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FIG. 3: Fit of (x/x) to an expression of the type represented by Eq. (16) for 12 Spanish provinces: Asturias, Almeria, Caceres, 
Cuenca, Baleares, Lleida, Badajoz, Avila, Guipuzcoa, Castellon, Valladolid and Guadalajara. 
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FIG. 4: Top panel: q — 21/2- metric diffusion at times t = 20 
(green dotted line), 70 (red dashed) and 110 (blue solid) using 
the text-parameters compared with the distribution of Sala- 
manca towns' population in 2010, fitted to a 1/2- log-normal 
distribution (solid line). Inset: variance of the relative growth 
vs. log-population (dots), confirming the yfx dependence for 
q — 1/2 dynamics (red line). Bottom panel: geometric diffu- 
sion (q — 1) at times t = 4 (green dotted), 9 (red dashed) and 
29 (blue solid) compared with the population distribution of 
Florida State (US) in 2010, fitted to a log-normal distribution 
(solid line). Inset: same as top panel's inset, confirming size 
independence and thus proportional dynamics. 




r (x 10000 Rank) 



FIG. 5: (/-metric diffusion with maximum size constraint for (from left to right panels) q — 1/2, 1 and 1.5. Rank-distributions 
and evolution (insets). 




FIG. 6: iV-constrained g-metric diffusion for 9 = 1, 1.5, and 
2. 
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FIG. 7: An example of population restriction arising out of geographical reasons. Left Panel: d'Aosta valley (Italy )— . Right 
Panel: Marshall islands^. 
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FIG. 8: SFIG-in-a-box examples, from top to bottom: RD of 
Huelva-province (Spain), D'Aosta Valley (Italy), and Mar- 
shall Islands (dots), compared with distribution Eq. (|65|) 
(lines). In the insets, the relative growth u vs. the loga- 
rithmic population u. 
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r/n (Normalized Rank) 

FIG. 9: SFIG under total-population constraint examples, in the reading order: RD of Navarra, Lleida, Zaragoza, Girona, 
Granada, Almeria, Alicante, and Vizcaya (dots) and RD of Eq. (|73[) (lines). Insets: relative growth x/x vs. log-population 
log a; fitted to Eq. (16) with the same value of q as in the RD. 



